Delayed Nondeterminism in Continuous-Time Markov Decision Processes
نویسندگان
چکیده
Schedulers in randomly timed games can be classified as to whether they use timing information or not. We consider continuous-time Markov decision processes (CTMDPs) and define a hierarchy of positional (P) and historydependent (H) schedulers which induce strictly tighter bounds on quantitative properties on CTMDPs. This classification into time abstract (TA), total time (TT) and fully time-dependent (T) schedulers is mainly based on the kind of timing details that the schedulers may exploit. We investigate when the resolution of nondeterminism may be deferred. In particular, we show that TTP and TAP schedulers allow for delaying nondeterminism for all measures, whereas this does neither hold for TP nor for any TAH scheduler. The core of our study is a transformation on CTMDPs which unifies the speed of outgoing transitions per state.
منابع مشابه
Efficient Computation of Time-Bounded Reachability Probabilities in Uniform Continuous-Time Markov Decision Processes
A continuous-time Markov decision process (CTMDP) is a generalization of a continuous-time Markov chain in which both probabilistic and nondeterministic choices co-exist. This paper presents an efficient algorithm to compute the maximum (or minimum) probability to reach a set of goal states within a given time bound in a uniform CTMDP, i.e., a CTMDP in which the delay time distribution per stat...
متن کاملOn $L_1$-weak ergodicity of nonhomogeneous continuous-time Markov processes
In the present paper we investigate the $L_1$-weak ergodicity of nonhomogeneous continuous-time Markov processes with general state spaces. We provide a necessary and sufficient condition for such processes to satisfy the $L_1$-weak ergodicity. Moreover, we apply the obtained results to establish $L_1$-weak ergodicity of quadratic stochastic processes.
متن کاملA Characterization of Meaningful Schedulers for Continuous-Time Markov Decision Processes
Continuous-time Markov decision process are an important variant of labelled transition systems having nondeterminism through labels and stochasticity through exponential fire-time distributions. Nondeterministic choices are resolved using the notion of a scheduler. In this paper we characterize the class of measurable schedulers, which is the most general one, and show how a measurable schedul...
متن کاملGame Theory in Formal Verification Scribe Notes for Lecture 8 Markov Decision Processes
Simple Stochastic Game nondeterminism nondeterminism probability probability nondeterminism nondeterminism probability 1 Definitions Markov Chain (MC) Definition 1. A Markov Chain ((S, E), δ) is a graph (S, E) with a function δ : S → D(S) that maps every state to a probability distribution of successor states. There is an edge between two states s, t in S iff the probability of going from s to ...
متن کاملStochastic Nondeterminism and Effectivity Functions
This paper investigates stochastic nondeterminism by relating nondeterministic labelled Markov processes and stochastic effectivity functions to each other; the underlying state spaces are continuous. Both generalizations to labelled Markov transition systems have been proposed recently with differing intentions. It turns out that they display surprising similarities and interesting differences...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009